Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 357
Filtrar
1.
Mov Disord ; 38(12): 2230-2240, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37735923

RESUMO

BACKGROUND: Spinocerebellar ataxia type 12 (SCA12) is a neurodegenerative disease caused by expansion of a CAG repeat in the PPP2R2B gene. OBJECTIVE: In this study, we tested the hypothesis that the PPP2R2B antisense (PPP2R2B-AS1) transcript containing a CUG repeat is expressed and contributes to SCA12 pathogenesis. METHODS: Expression of PPP2R2B-AS1 transcript was detected in SCA12 human induced pluripotent stem cells (iPSCs), iPSC-derived NGN2 neurons, and SCA12 knock-in mouse brains using strand-specific reverse transcription polymerase chain reaction. The tendency of expanded PPP2R2B-AS1 (expPPP2R2B-AS1) RNA to form foci, a marker of toxic processes involving mutant RNAs, was examined in SCA12 cell models by fluorescence in situ hybridization. The apoptotic effect of expPPP2R2B-AS1 transcripts on SK-N-MC neuroblastoma cells was evaluated by caspase 3/7 activity. Western blot was used to examine the expression of repeat associated non-ATG-initiated translation of expPPP2R2B-AS1 transcript in SK-N-MC cells. RESULTS: The repeat region in the PPP2R2B gene locus is bidirectionally transcribed in SCA12 iPSCs, iPSC-derived NGN2 neurons, and SCA12 mouse brains. Transfected expPPP2R2B-AS1 transcripts induce apoptosis in SK-N-MC cells, and the apoptotic effect may be mediated, at least in part, by the RNA secondary structure. The expPPP2R2B-AS1 transcripts form CUG RNA foci in SK-N-MC cells. expPPP2R2B-AS1 transcript is translated in the alanine open reading frame (ORF) via repeat-associated non-ATG translation, which is diminished by single-nucleotide interruptions within the CUG repeat and MBNL1 overexpression. CONCLUSIONS: These findings suggest that PPP2R2B-AS1 contributes to SCA12 pathogenesis and may therefore provide a novel therapeutic target for the disease. © 2023 International Parkinson and Movement Disorder Society.


Assuntos
Sequências Repetitivas de Aminoácidos , Ataxias Espinocerebelares , Transcrição Gênica , Células-Tronco Pluripotentes Induzidas , Neurônios/patologia , Apoptose/genética , Linhagem Celular , Sequências Repetitivas de Aminoácidos/genética , Proteínas de Ligação a RNA/metabolismo , Proteína Fosfatase 2/genética , Proteína Fosfatase 2/metabolismo , Técnicas de Introdução de Genes , Humanos , Animais , Camundongos , Ataxias Espinocerebelares/genética , Ataxias Espinocerebelares/fisiopatologia , RNA Antissenso/genética
2.
Am J Physiol Cell Physiol ; 323(6): C1860-C1871, 2022 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-36374170

RESUMO

Mutations in tripartite motif-containing protein 32 (TRIM32), especially in NHL repeats, have been found in skeletal muscle in patients with type 2H limb-girdle muscular dystrophy (LGMD2H). However, the roles of the NHL repeats of TRIM32 in skeletal muscle functions have not been well addressed. In the present study, to examine the functional role(s) of the TRIM32 NHL repeats in skeletal muscle, TRIM32-binding proteins in skeletal muscle were first searched using a binding assay and MALDI-TOF/TOF. Sarcoplasmic/endoplasmic reticulum Ca2+-ATPase 1a (SERCA1a) was found to be a TRIM32-binding protein. Next, a deletion mutant of TRIM32 missing the NHL repeats (NHL-Del) was expressed in mouse primary skeletal myotubes during myoblast differentiation into myotubes. Ca2+ movement in the myotubes was examined using single-cell Ca2+ imaging. Unlike wild-type (WT) TRIM32, NHL-Del did not enhance the amount of Ca2+ release from the sarcoplasmic reticulum (SR), Ca2+ release for excitation-contraction (EC) coupling, or extracellular Ca2+ entry via store-operated Ca2+ entry (SOCE). In addition, even compared with the vector control, NHL-Del resulted in reduced SOCE due to reduced expression of extracellular Ca2+ entry channels. Transmission electron microscopy (TEM) observation of the myotubes revealed that NHL-Del induced the formation of abnormal vacuoles and tubular structures in the cytosol. Therefore, by binding to SERCA1a via its NHL repeats, TRIM32 may participate in the regulation of Ca2+ movement for skeletal muscle contraction and the formation of cellular vacuoles and tubular structures in skeletal muscle. Functional defects in TRIM32 due to mutations in NHL repeats may be pathogenic toward LGMD2H.


Assuntos
Cálcio , Músculo Esquelético , Distrofia Muscular do Cíngulo dos Membros , Sequências Repetitivas de Aminoácidos , Animais , Camundongos , Cálcio/metabolismo , Fibras Musculares Esqueléticas/metabolismo , Músculo Esquelético/metabolismo , Distrofia Muscular do Cíngulo dos Membros/genética , Distrofia Muscular do Cíngulo dos Membros/metabolismo , Distrofia Muscular do Cíngulo dos Membros/patologia , Retículo Sarcoplasmático/metabolismo , ATPases Transportadoras de Cálcio do Retículo Sarcoplasmático/metabolismo , Sequências Repetitivas de Aminoácidos/genética , Sequências Repetitivas de Aminoácidos/fisiologia
3.
BMC Plant Biol ; 21(1): 247, 2021 May 31.
Artigo em Inglês | MEDLINE | ID: mdl-34059006

RESUMO

BACKGROUND: Peppers (Capsicum annuum L.) containing distinct capsaicinoids are the most widely cultivated spices in the world. However, extreme genomic diversity among species represents an obstacle to breeding pepper. RESULTS: Here, we report de novo genome assemblies of Capsicum annuum 'Early Calwonder (non-pungent, ECW)' and 'Small Fruit (pungent, SF)' along with their annotations. In total, we assembled 2.9 Gb of ECW and SF genome sequences, representing over 91% of the estimated genome sizes. Structural and functional annotation of the two pepper genomes generated about 35,000 protein-coding genes each, of which 93% were assigned putative functions. Comparison between newly and publicly available pepper gene annotations revealed both shared and specific gene content. In addition, a comprehensive analysis of nucleotide-binding and leucine-rich repeat (NLR) genes through whole-genome alignment identified five significant regions of NLR copy number variation (CNV). Detailed comparisons of those regions revealed that these CNVs were generated by intra-specific genomic variations that accelerated diversification of NLRs among peppers. CONCLUSIONS: Our analyses unveil an evolutionary mechanism responsible for generating CNVs of NLRs among pepper accessions, and provide novel genomic resources for functional genomics and molecular breeding of disease resistance in Capsicum species.


Assuntos
Capsicum/genética , Evolução Molecular , Genoma de Planta , Leucina/genética , Sequências Repetitivas de Aminoácidos/genética , Variações do Número de Cópias de DNA , Resistência à Doença/genética , Anotação de Sequência Molecular , Filogenia , Melhoramento Vegetal , Doenças das Plantas/genética , Especificidade da Espécie
4.
Nat Commun ; 12(1): 3396, 2021 06 07.
Artigo em Inglês | MEDLINE | ID: mdl-34099711

RESUMO

Amyotrophic lateral sclerosis and frontotemporal dementia are two neurodegenerative diseases with overlapping clinical features and the pathological hallmark of cytoplasmic deposits of misfolded proteins. The most frequent cause of familial forms of these diseases is a hexanucleotide repeat expansion in the non-coding region of the C9ORF72 gene that is translated into dipeptide repeat polymers. Here we show that proline/arginine repeat polymers derail protein folding by sequestering molecular chaperones. We demonstrate that proline/arginine repeat polymers inhibit the folding catalyst activity of PPIA, an abundant molecular chaperone and prolyl isomerase in the brain that is altered in amyotrophic lateral sclerosis. NMR spectroscopy reveals that proline/arginine repeat polymers bind to the active site of PPIA. X-ray crystallography determines the atomic structure of a proline/arginine repeat polymer in complex with the prolyl isomerase and defines the molecular basis for the specificity of disease-associated proline/arginine polymer interactions. The combined data establish a toxic mechanism that is specific for proline/arginine dipeptide repeat polymers and leads to derailed protein homeostasis in C9orf72-associated neurodegenerative diseases.


Assuntos
Esclerose Lateral Amiotrófica/patologia , Proteína C9orf72/genética , Dipeptídeos/metabolismo , Demência Frontotemporal/patologia , Peptidilprolil Isomerase/metabolismo , Esclerose Lateral Amiotrófica/genética , Arginina/genética , Arginina/metabolismo , Biopolímeros/metabolismo , Encéfalo/patologia , Domínio Catalítico , Cristalografia por Raios X , Expansão das Repetições de DNA , Dipeptídeos/genética , Demência Frontotemporal/genética , Humanos , Ressonância Magnética Nuclear Biomolecular , Peptidilprolil Isomerase/isolamento & purificação , Peptidilprolil Isomerase/ultraestrutura , Prolina/genética , Prolina/metabolismo , Agregados Proteicos/genética , Ligação Proteica , Dobramento de Proteína , Proteínas Recombinantes/isolamento & purificação , Proteínas Recombinantes/metabolismo , Proteínas Recombinantes/ultraestrutura , Sequências Repetitivas de Aminoácidos/genética
5.
Proc Natl Acad Sci U S A ; 118(3)2021 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-33431684

RESUMO

The HIV-1 gp41 N-heptad repeat (NHR) region of the prehairpin intermediate, which is transiently exposed during HIV-1 viral membrane fusion, is a validated clinical target in humans and is inhibited by the Food and Drug Administration (FDA)-approved drug enfuvirtide. However, vaccine candidates targeting the NHR have yielded only modest neutralization activities in animals; this inhibition has been largely restricted to tier-1 viruses, which are most sensitive to neutralization by sera from HIV-1-infected individuals. Here, we show that the neutralization activity of the well-characterized NHR-targeting antibody D5 is potentiated >5,000-fold in TZM-bl cells expressing FcγRI compared with those without, resulting in neutralization of many tier-2 viruses (which are less susceptible to neutralization by sera from HIV-1-infected individuals and are the target of current antibody-based vaccine efforts). Further, antisera from guinea pigs immunized with the NHR-based vaccine candidate (ccIZN36)3 neutralized tier-2 viruses from multiple clades in an FcγRI-dependent manner. As FcγRI is expressed on macrophages and dendritic cells, which are present at mucosal surfaces and are implicated in the early establishment of HIV-1 infection following sexual transmission, these results may be important in the development of a prophylactic HIV-1 vaccine.


Assuntos
Proteína gp41 do Envelope de HIV/imunologia , Infecções por HIV/tratamento farmacológico , Receptores de IgG/imunologia , Sequências Repetitivas de Aminoácidos/genética , Sequência de Aminoácidos , Animais , Anticorpos Neutralizantes/imunologia , Anticorpos Neutralizantes/farmacologia , Afinidade de Anticorpos , Cobaias , Anticorpos Anti-HIV/imunologia , Anticorpos Anti-HIV/farmacologia , Proteína gp41 do Envelope de HIV/genética , Infecções por HIV/genética , Infecções por HIV/imunologia , Infecções por HIV/virologia , Soropositividade para HIV/imunologia , HIV-1/efeitos dos fármacos , HIV-1/imunologia , HIV-1/patogenicidade , Humanos , Soros Imunes/imunologia , Soros Imunes/farmacologia , Imunização , Imunoglobulina G/imunologia , Sequências Repetitivas de Aminoácidos/imunologia , Internalização do Vírus/efeitos dos fármacos
6.
Arch Virol ; 166(1): 43-64, 2021 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-33052487

RESUMO

Leucine-rich repeats (LRRs) are present in over 563,000 proteins from viruses to eukaryotes. LRRs repeat in tandem and have been classified into fifteen classes in which the repeat unit lengths range from 20 to 29 residues. Most LRR proteins are involved in protein-protein or ligand interactions. The amount of genome sequence data from viruses is increasing rapidly, and although viral LRR proteins have been identified, a comprehensive sequence analysis has not yet been done, and their structures, functions, and evolution are still unknown. In the present study, we characterized viral LRRs by sequence analysis and identified over 600 LRR proteins from 89 virus species. Most of these proteins were from double-stranded DNA (dsDNA) viruses, including nucleocytoplasmic large dsDNA viruses (NCLDVs). We found that the repeating unit lengths of 11 types are one to five residues shorter than those of the seven known corresponding LRR classes. The repeating units of six types are 19 residues long and are thus the shortest among all LRRs. In addition, two of the LRR types are unique and have not been observed in bacteria, archae or eukaryotes. Conserved strongly hydrophobic residues such as Leu, Val or Ile in the consensus sequences are replaced by Cys with high frequency. Phylogenetic analysis indicated that horizontal gene transfer of some viral LRR genes had occurred between the virus and its host. We suggest that the shortening might contribute to the survival strategy of viruses. The present findings provide a new perspective on the origin and evolution of LRRs.


Assuntos
DNA/genética , Leucina/genética , Sequências Repetitivas de Aminoácidos/genética , Vírus/genética , Archaea/virologia , Bactérias/virologia , Sequência Consenso/genética , Eucariotos/virologia , Filogenia , Proteínas Virais/genética
7.
Nucleic Acids Res ; 49(D1): D412-D419, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33125078

RESUMO

The Pfam database is a widely used resource for classifying protein sequences into families and domains. Since Pfam was last described in this journal, over 350 new families have been added in Pfam 33.1 and numerous improvements have been made to existing entries. To facilitate research on COVID-19, we have revised the Pfam entries that cover the SARS-CoV-2 proteome, and built new entries for regions that were not covered by Pfam. We have reintroduced Pfam-B which provides an automatically generated supplement to Pfam and contains 136 730 novel clusters of sequences that are not yet matched by a Pfam family. The new Pfam-B is based on a clustering by the MMseqs2 software. We have compared all of the regions in the RepeatsDB to those in Pfam and have started to use the results to build and refine Pfam repeat families. Pfam is freely available for browsing and download at http://pfam.xfam.org/.


Assuntos
Biologia Computacional/estatística & dados numéricos , Bases de Dados de Proteínas , Proteínas/metabolismo , Proteoma/metabolismo , Animais , COVID-19/epidemiologia , COVID-19/prevenção & controle , COVID-19/virologia , Biologia Computacional/métodos , Epidemias , Humanos , Internet , Modelos Moleculares , Estrutura Terciária de Proteína , Proteínas/química , Proteínas/genética , Proteoma/classificação , Proteoma/genética , Sequências Repetitivas de Aminoácidos/genética , SARS-CoV-2/genética , SARS-CoV-2/fisiologia , Análise de Sequência de Proteína/métodos
8.
Proc Natl Acad Sci U S A ; 117(50): 32098-32104, 2020 12 15.
Artigo em Inglês | MEDLINE | ID: mdl-33257570

RESUMO

The deadly symptoms of malaria occur as Plasmodium parasites replicate within blood cells. Members of several variant surface protein families are expressed on infected blood cell surfaces. Of these, the largest and most ubiquitous are the Plasmodium-interspersed repeat (PIR) proteins, with more than 1,000 variants in some genomes. Their functions are mysterious, but differential pir gene expression associates with acute or chronic infection in a mouse malaria model. The membership of the PIR superfamily, and whether the family includes Plasmodium falciparum variant surface proteins, such as RIFINs and STEVORs, is controversial. Here we reveal the structure of the extracellular domain of a PIR from Plasmodium chabaudi We use structure-guided sequence analysis and molecular modeling to show that this fold is found across PIR proteins from mouse- and human-infective malaria parasites. Moreover, we show that RIFINs and STEVORs are not PIRs. This study provides a structure-guided definition of the PIRs and a molecular framework to understand their evolution.


Assuntos
Plasmodium chabaudi/ultraestrutura , Domínios Proteicos/imunologia , Proteínas de Protozoários/ultraestrutura , Sequências Repetitivas de Aminoácidos/imunologia , Antígenos de Protozoários/genética , Antígenos de Protozoários/imunologia , Antígenos de Protozoários/ultraestrutura , Dicroísmo Circular , Genoma de Protozoário/genética , Humanos , Malária/imunologia , Malária/virologia , Proteínas de Membrana/genética , Proteínas de Membrana/imunologia , Proteínas de Membrana/ultraestrutura , Família Multigênica/genética , Família Multigênica/imunologia , Filogenia , Plasmodium chabaudi/genética , Plasmodium chabaudi/imunologia , Domínios Proteicos/genética , Proteínas de Protozoários/genética , Proteínas de Protozoários/imunologia , Sequências Repetitivas de Aminoácidos/genética
9.
Acta Neuropathol Commun ; 8(1): 184, 2020 11 09.
Artigo em Inglês | MEDLINE | ID: mdl-33168090

RESUMO

A C9orf72 repeat expansion is the most common genetic cause of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis. One of the suggested pathomechanisms is toxicity from dipeptide repeat proteins (DPRs), which are generated via unconventional translation of sense and antisense repeat transcripts with poly-GA, poly-GP and poly-GR being the most abundant dipeptide proteins. Animal and cellular studies highlight a neurotoxic role of poly-GR and poly-PR and to a lesser degree of poly-GA. Human post-mortem studies in contrast have been much less clear on a potential role of DPR toxicity but have largely focused on immunohistochemical methods to detect aggregated DPR inclusions. This study uses protein fractionation and sensitive immunoassays to quantify not only insoluble but also soluble poly-GA, poly-GP and poly-GR concentrations in brain homogenates of FTD patients with C9orf72 mutation across four brain regions. We show that soluble DPRs are less abundant in clinically affected areas (i.e. frontal and temporal cortices). In contrast, the cerebellum not only shows the largest DPR load but also the highest relative DPR solubility. Finally, poly-GR levels and poly-GP solubility correlate with clinical severity. These findings provide the first cross-comparison of soluble and insoluble forms of all sense DPRs and shed light on the distribution and role of soluble DPRs in the etiopathogenesis of human C9orf72-FTD.


Assuntos
Encéfalo/metabolismo , Dipeptídeos/metabolismo , Demência Frontotemporal/metabolismo , Polímeros/metabolismo , Proteínas/metabolismo , Idoso , Proteína C9orf72/genética , Expansão das Repetições de DNA/genética , Feminino , Demência Frontotemporal/genética , Humanos , Masculino , Pessoa de Meia-Idade , Sequências Repetitivas de Aminoácidos/genética , Solubilidade
10.
J Biosci ; 452020.
Artigo em Inglês | MEDLINE | ID: mdl-32713858

RESUMO

Tandemly repeated structural motifs in proteins form highly stable structural folds and provide multiple binding sites associated with diverse functional roles. The tertiary structure and function of these proteins are determined by the type and copy number of the repeating units. Each repeat type exhibits a unique pattern of intra- and inter-repeat unit interactions that is well-captured by the topological features in the network representation of protein structures. Here we present an improved version of our graph based algorithm, PRIGSA, with structure-based validation and filtering steps incorporated for accurate detection of tandem structural repeats. The algorithm integrates available knowledge on repeat families with de novo prediction to detect repeats in single monomer chains as well as in multimeric protein complexes. Three levels of performance evaluation are presented: comparison with state-of-the-art algorithms on benchmark dataset of repeat and nonrepeat proteins, accuracy in the detection of members of 13 known repeat families reported in UniProt and execution on the complete Protein Data Bank to show its ability to identify previously uncharacterized proteins. A ~3-fold increase in the coverage of the members of 13 known families and 3408 novel uncharacterized structural repeat proteins are identified on executing it on PDB. PRIGSA2 is available at http:// bioinf.iiit.ac.in/PRIGSA2/.


Assuntos
Proteínas/isolamento & purificação , Sequências Repetitivas de Aminoácidos/genética , Software , Algoritmos , Sítios de Ligação/genética , Simulação por Computador , Bases de Dados de Proteínas , Humanos , Modelos Moleculares , Proteínas/genética
11.
Clin Genet ; 98(4): 384-389, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32639022

RESUMO

Oculo-auriculo-vertebral spectrum (OAVS) [MIM:164210], or Goldenhar syndrome, is a developmental disorder associating defects of structures derived from the first and second branchial arches. The genetic origin of OAVS is supported by the description of rare deleterious variants in a few causative genes, and several chromosomal copy number variations. We describe here a large family with eight male members affected by a mild form of the spectrum, mostly auricular defects, harboring a hemizygous ZIC3 variant detected by familial exome sequencing: c.159_161dup p.(Ala55dup), resulting in an expansion of the normal 10 consecutive alanine residues to 11 alanines. Segregation analysis shows its presence in all the affected individuals, with a recessive X-linked transmission. Whole-genome sequencing performed in another affected male allowed to exclude linkage disequilibrium between this ZIC3 variant and another potential pathogenic variant in this family. Furthermore, by screening of a cohort of 274 OAVS patients, we found 1 male patient carrying an expansion of 10 to 12 alanines, a variant previously reported in patient presenting with VACTERL. Loss-of-function variants of ZIC3 are causing heterotaxy or cardiac malformations. These alanine expansion variants could have a different impact on the protein and thereby resulting in a different phenotype within the OAVS/VACTERL.


Assuntos
Canal Anal/anormalidades , Esôfago/anormalidades , Doenças Genéticas Ligadas ao Cromossomo X/genética , Predisposição Genética para Doença , Síndrome de Goldenhar/genética , Cardiopatias Congênitas/genética , Proteínas de Homeodomínio/genética , Rim/anormalidades , Deformidades Congênitas dos Membros/genética , Coluna Vertebral/anormalidades , Traqueia/anormalidades , Fatores de Transcrição/genética , Adolescente , Adulto , Alanina/genética , Canal Anal/patologia , Região Branquial/diagnóstico por imagem , Região Branquial/patologia , Criança , Pré-Escolar , Variações do Número de Cópias de DNA/genética , Esôfago/patologia , Feminino , Doenças Genéticas Ligadas ao Cromossomo X/patologia , Síndrome de Goldenhar/patologia , Cardiopatias Congênitas/patologia , Humanos , Lactente , Rim/patologia , Deformidades Congênitas dos Membros/patologia , Mutação com Perda de Função/genética , Masculino , Sequências Repetitivas de Aminoácidos/genética , Coluna Vertebral/patologia , Traqueia/patologia , Sequenciamento Completo do Genoma , Adulto Jovem
12.
PLoS One ; 15(4): e0227464, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32240172

RESUMO

Expanded CAG nucleotide repeats are the underlying genetic cause of at least 14 incurable diseases, including Huntington's disease (HD). The toxicity associated with many CAG repeat expansions is thought to be due to the translation of the CAG repeat to create a polyQ protein, which forms toxic oligomers and aggregates. However, recent studies show that HD CAG repeats undergo a non-canonical form of translation called Repeat-associated non-AUG dependent (RAN) translation. RAN translation of the CAG sense and CUG anti-sense RNAs produces six distinct repeat peptides: polyalanine (polyAla, from both CAG and CUG repeats), polyserine (polySer), polyleucine (polyLeu), polycysteine (polyCys), and polyglutamine (polyGln). The toxic potential of individual CAG-derived RAN polypeptides is not well understood. We developed pure C. elegans protein models for each CAG RAN polypeptide using codon-varied expression constructs that preserve RAN protein sequence but eliminate repetitive CAG/CUG RNA. While all RAN polypeptides formed aggregates, only polyLeu was consistently toxic across multiple cell types. In GABAergic neurons, which exhibit significant neurodegeneration in HD patients, codon-varied (Leu)38, but not (Gln)38, caused substantial neurodegeneration and motility defects. Our studies provide the first in vivo evaluation of CAG-derived RAN polypeptides in a multicellular model organism and suggest that polyQ-independent mechanisms, such as RAN-translated polyLeu peptides, may have a significant pathological role in CAG repeat expansion disorders.


Assuntos
Doença de Huntington/genética , Neurônios Motores/metabolismo , Agregados Proteicos/genética , Expansão das Repetições de Trinucleotídeos/genética , Animais , Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/química , Proteínas de Caenorhabditis elegans/genética , Modelos Animais de Doenças , Humanos , Doença de Huntington/patologia , Neurônios Motores/patologia , Peptídeos/genética , Peptídeos/metabolismo , Biossíntese de Proteínas , RNA Antissenso/genética , Sequências Repetitivas de Aminoácidos/genética
13.
Proc Natl Acad Sci U S A ; 117(16): 8870-8875, 2020 04 21.
Artigo em Inglês | MEDLINE | ID: mdl-32245816

RESUMO

The ability to precisely design large proteins with diverse shapes would enable applications ranging from the design of protein binders that wrap around their target to the positioning of multiple functional sites in specified orientations. We describe a protein backbone design method for generating a wide range of rigid fusions between helix-containing proteins and use it to design 75,000 structurally unique junctions between monomeric and homo-oligomeric de novo designed and ankyrin repeat proteins (RPs). Of the junction designs that were experimentally characterized, 82% have circular dichroism and solution small-angle X-ray scattering profiles consistent with the design models and are stable at 95 °C. Crystal structures of four designed junctions were in close agreement with the design models with rmsds ranging from 0.9 to 1.6 Å. Electron microscopic images of extended tetrameric structures and ∼10-nm-diameter "L" and "V" shapes generated using the junctions are close to the design models, demonstrating the control the rigid junctions provide for protein shape sculpting over multiple nanometer length scales.


Assuntos
Modelos Moleculares , Engenharia de Proteínas/métodos , Proteínas/ultraestrutura , Sequências Repetitivas de Aminoácidos/genética , Dicroísmo Circular , Microscopia Eletrônica , Biblioteca de Peptídeos , Conformação Proteica em alfa-Hélice/genética , Dobramento de Proteína , Proteínas/química , Proteínas/genética , Espalhamento a Baixo Ângulo , Difração de Raios X
14.
Biopolymers ; 111(4): e23348, 2020 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-32031681

RESUMO

Limitations associated with immunoglobulins have motivated the search for novel binding scaffolds. Repeat proteins have emerged as one promising class of scaffolds, but often are limited to binding protein and peptide targets. An exception is the repeat proteins of the immune system, which have in recent years served as an inspiration for binding scaffolds which can bind glycans and other classes of biomolecule. Like other repeat proteins, these proteins can be very stable and have a monomeric mode of binding, with elongated and highly variable binding surfaces. The ability to target glycans and glycoproteins fill an important gap in current tools for research and biomedical applications.


Assuntos
Proteínas de Transporte/química , Sistema Imunitário/química , Imunoglobulinas/química , Engenharia de Proteínas/métodos , Sequências Repetitivas de Aminoácidos/genética , Motivos de Aminoácidos/genética , Sequência de Aminoácidos/genética , Animais , Proteínas de Transporte/genética , Proteínas de Transporte/metabolismo , Humanos , Sistema Imunitário/metabolismo , Imunoglobulinas/metabolismo , Proteínas NLR/química , Peptídeos/química , Peptídeos/metabolismo , Ligação Proteica , Domínios Proteicos/genética , Receptores Toll-Like/química
15.
J Mol Biol ; 432(2): 448-466, 2020 01 17.
Artigo em Inglês | MEDLINE | ID: mdl-31756331

RESUMO

Crosstalk exists when two or more post-translational modifications, nearby in sequence or 3D space, affect each other or a protein's interactions. Saccharomyces cerevisiae protein Npl3p has six repeats of sequence SRGG, in a disordered domain, which can carry arginine methylation and serine phosphorylation. Crosstalk of the modifications controls Npl3p interactions with nuclear import, export, and other proteins. Here, we asked whether repeated SRGG motifs existed in other S. cerevisiae proteins and whether they serve a related function. Two other proteins had multiple SRGG motifs: Nop1p (fibrillarin) and Gar1p, both nucleolar proteins, which had nine and four motifs, respectively. For Nop1p, we first showed it to be extensively methylated in vivo. We then showed that the Nop1p SRGG motif is subjected to methylation by Hmt1p, phosphorylation by Sky1p, and Glc7p dephosphorylation and that there is crosstalk whereby phosphorylation blocks methylation. This is consistent with our recent motif analysis of Hmt1p, which revealed a negative specificity for acidic residues at -1 and -2 positions. On knockout of HMT1, Nop1p-GFP localization was not typically nucleolar. Conditional two-hybrid analysis, of Nop1p with C/D box small ribonuclear proteins Nop56p and Nop58p, suggested this may be associated with decreased protein-protein interactions on loss of arginine methylation. The effect of SRGG phosphorylation on the interactions of Nop1p remains unknown yet was predicted to cause a structural disorder-to-order transition in the Nop1p N-terminal domain. The SRGG motif is one of very few examples of modification crosstalk that has related functions in multiple proteins from the same species.


Assuntos
Motivos de Aminoácidos/genética , Núcleo Celular/genética , Proteínas Cromossômicas não Histona/genética , Sequências Repetitivas de Aminoácidos/genética , Transporte Ativo do Núcleo Celular/genética , Arginina/genética , Núcleo Celular/ultraestrutura , Proteínas Cromossômicas não Histona/química , Metilação , Proteínas Nucleares/genética , Fosforilação/genética , Proteína Fosfatase 1/genética , Proteínas Serina-Treonina Quinases/genética , Proteína-Arginina N-Metiltransferases/genética , Proteínas de Ligação a RNA/genética , Proteínas Repressoras/genética , Ribonucleoproteínas Nucleolares Pequenas/genética , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética , Serina/genética
16.
Biochem J ; 477(2): 445-458, 2020 01 31.
Artigo em Inglês | MEDLINE | ID: mdl-31808794

RESUMO

The neuroblastoma breakpoint family (NBPF) consists of 24 members that play an important role in neuroblastoma and other cancers. NBPF is an evolutionarily recent gene family that encodes several repeats of Olduvai domain and an abundant N-terminal region. The function and biochemical properties of both Olduvai domain and the N-terminal region remain enigmatic. Human NBPF15 encodes a 670 AA protein consisting of six clades of Olduvai domains. In this study, we synthesized and expressed full-length NBPF15, and purified a range of NBPF15 truncations which were analyzed using dynamic light scattering (DLS), superdex200 (S200), small-angle X-ray scattering (SAXS), far-UV circular dichroism (CD) spectroscopy, transmission electron microscope (TEM), and crystallography. We found that proteins containing both the N-terminal region and Olduvai domain are heterogeneous with multiple types of aggregates, and some of them underwent a liquid-to-solid phase transition, probably because of the entanglement within the N-terminal coiled-coil. Proteins that contain only the Olduvai domain are homogeneous extended monomers, and those with the conserved clade 1 (CON1) have manifested a tendency to crystallize. We suggest that the entanglements between the mosaic disorder-ordered segments in NBPF15 N terminus have triggered the multiple types of aggregates and phase transition of NBPF15 proteins, which could be associated with Olduvai-related cognitive dysfunction diseases.


Assuntos
Disfunção Cognitiva/genética , Peptídeos e Proteínas de Sinalização Intracelular/genética , Agregados Proteicos/genética , Dicroísmo Circular , Disfunção Cognitiva/patologia , Expressão Gênica/genética , Humanos , Peptídeos e Proteínas de Sinalização Intracelular/química , Microscopia Eletrônica de Transmissão , Transição de Fase , Domínios Proteicos/genética , Sequências Repetitivas de Aminoácidos/genética , Espalhamento a Baixo Ângulo , Difração de Raios X
17.
Genome Biol Evol ; 11(11): 3159-3178, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-31589292

RESUMO

Homopolymeric amino acid repeats (AARs) like polyalanine (polyA) and polyglutamine (polyQ) in some developmental proteins (DPs) regulate certain aspects of organismal morphology and behavior, suggesting an evolutionary role for AARs as developmental "tuning knobs." It is still unclear, however, whether these are occasional protein-specific phenomena or hints at the existence of a whole AAR-based regulatory system in DPs. Using novel approaches to trace their functional and evolutionary history, we find quantitative evidence supporting a generalized, combinatorial role of AARs in developmental processes with evolutionary implications. We observe nonrandom AAR distributions and combinations in HOX and other DPs, as well as in their interactomes, defining elements of a proteome-wide combinatorial functional code whereby different AARs and their combinations appear preferentially in proteins involved in the development of specific organs/systems. Such functional associations can be either static or display detectable evolutionary dynamics. These findings suggest that progressive changes in AAR occurrence/combination, by altering embryonic development, may have contributed to taxonomic divergence, leaving detectable traces in the evolutionary history of proteomes. Consistent with this hypothesis, we find that the evolutionary trajectories of the 20 AARs in eukaryotic proteomes are highly interrelated and their individual or compound dynamics can sharply mark taxonomic boundaries, or display clock-like trends, carrying overall a strong phylogenetic signal. These findings provide quantitative evidence and an interpretive framework outlining a combinatorial system of AARs whose compound dynamics mark at the same time DP functions and evolutionary transitions.


Assuntos
Eucariotos/genética , Evolução Molecular , Genes Controladores do Desenvolvimento/genética , Filogenia , Sequências Repetitivas de Aminoácidos/genética , Animais , Eucariotos/crescimento & desenvolvimento , Genes Homeobox , Genoma , Humanos , Proteoma
18.
PLoS Comput Biol ; 15(8): e1007282, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-31415557

RESUMO

The coding space of protein sequences is shaped by evolutionary constraints set by requirements of function and stability. We show that the coding space of a given protein family-the total number of sequences in that family-can be estimated using models of maximum entropy trained on multiple sequence alignments of naturally occuring amino acid sequences. We analyzed and calculated the size of three abundant repeat proteins families, whose members are large proteins made of many repetitions of conserved portions of ∼30 amino acids. While amino acid conservation at each position of the alignment explains most of the reduction of diversity relative to completely random sequences, we found that correlations between amino acid usage at different positions significantly impact that diversity. We quantified the impact of different types of correlations, functional and evolutionary, on sequence diversity. Analysis of the detailed structure of the coding space of the families revealed a rugged landscape, with many local energy minima of varying sizes with a hierarchical structure, reminiscent of fustrated energy landscapes of spin glass in physics. This clustered structure indicates a multiplicity of subtypes within each family, and suggests new strategies for protein design.


Assuntos
Proteínas/química , Proteínas/genética , Sequências Repetitivas de Aminoácidos/genética , Algoritmos , Sequência de Aminoácidos , Biologia Computacional , Sequência Conservada , Entropia , Evolução Molecular , Modelos Moleculares , Conformação Proteica , Dobramento de Proteína , Alinhamento de Sequência/estatística & dados numéricos , Homologia de Sequência de Aminoácidos , Termodinâmica
19.
Mol Biol (Mosk) ; 53(3): 524-528, 2019.
Artigo em Russo | MEDLINE | ID: mdl-31184618

RESUMO

Proteins with homo-repeats of more than 4 amino acid residues in length were examined to understand whether some splicing sites in pre-mRNA may be attributed to homo-repeats in human proteins. The human proteome was found to contain a total of 404 proteins with homo-repeats that account for at least one splicing site in pre-mRNA. Pre-mRNA splicing sites were more often found in the C-terminal part (67%) than in the middle orN-terminal part of a homo-repeat. Ten homo-repeats were identified to have two splicing sites per repeat. The repeats were lysine homo-repeats in all but one case.


Assuntos
Proteínas/análise , Proteínas/química , Precursores de RNA/genética , Sítios de Splice de RNA/genética , Sequências Repetitivas de Aminoácidos/genética , Humanos , Lisina/genética , Lisina/metabolismo , Proteínas/genética , Proteoma/análise , Proteoma/química , Proteoma/genética , Splicing de RNA/genética
20.
J Comput Biol ; 26(11): 1253-1261, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31211597

RESUMO

Gene fusion is known to be one of the mechanisms of a new gene formation. Most bioinformatics methods for studying fused genes are based on the sequence similarity search. However, if the ancestral sequences were lost during evolution or changed too much, it is impossible to detect the fusion. Previously, we have developed a method of searching for triplet periodicity (TP) change points in protein-coding sequences (CDS) and showed the possible relation of this phenomenon with gene formation as a result of fusion. In this study, we improved the TP change point detection method and studied the genes of six eukaryotic genomes. At the level of 2%-3% of the probability of type I error, TP change points were found in 20%-40% of genes. Further analysis showed that about 30% of the TP change points can be explained by amino acid repeats. Another 30% can be potentially fused genes, alignment for which was detected by the BLAST program. We believe that the rest of the results can be fused genes, the ancestral sequences for which have been lost. The method is more sensitive to TP changes and allowed us to find up to two to three times more cases of significant TP change points than our previous method.


Assuntos
Biologia Computacional/métodos , Genoma/genética , Fases de Leitura Aberta/genética , Sequências Repetitivas de Aminoácidos/genética , Animais , Eucariotos/genética , Humanos , Alinhamento de Sequência/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA